VUDENC: Vulnerability Detection with Deep Learning on a Natural Codebase for Python

نویسندگان

چکیده

Identifying potential vulnerable code is important to improve the security of our software systems. However, manual detection vulnerabilities requires expert knowledge and time-consuming, must be supported by automated techniques. Such vulnerability techniques should achieve a high accuracy, point developers directly fragments, scale real-world software, generalize across boundaries specific project, require no or only moderate setup configuration effort. In this article, we present Vudenc (Vulnerability Detection with Deep Learning on Natural Codebase), deep learning-based tool that automatically learns features from large Python codebase. applies word2vec model identify semantically similar tokens provide vector representation. A network long-short-term memory cells (LSTM) then used classify token sequences at fine-grained level, highlight areas in source are likely contain vulnerabilities, confidence levels for its predictions. To evaluate Vudenc, 1,009 vulnerability-fixing commits different GitHub repositories seven types (SQL injection, XSS, Command XSRF, Remote execution, Path disclosure, Open redirect) training. experimental evaluation, achieves recall 78%–87%, precision 82%–96%, an F1 score 80%–90%. Vudenc’s code, datasets corpus available reproduction. Our results suggest capable outperforming most competitors terms vulnerably capabilities software. Comparable accuracy was achieved synthetic benchmarks, within single projects, much coarser level granularity such as entire files.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Melanoma detection with a deep learning model

Background: Skin cancer is one of the most common forms of cancer in the world and melanoma is the deadliest type of skin cancer. Both melanoma and melanocytic nevi begin in melanocytes (cells that produce melanin). However, melanocytic nevi are benign whereas melanoma is malignant. This work proposes a deep learning model for classification of these two lesions.    Methods: In this analytic s...

متن کامل

VulDeePecker: A Deep Learning-Based System for Vulnerability Detection

The automatic detection of software vulnerabilities is an important research problem. However, existing solutions to this problem rely on human experts to define features and often miss many vulnerabilities (i.e., incurring high false negative rate). In this paper, we initiate the study of using deep learning-based vulnerability detection to relieve human experts from the tedious and subjective...

متن کامل

Vulnerability of Deep Learning

The Renormalisation Group (RG) provides a framework in which it is possible to assess whether a deep-learning network is sensitive to small changes in the input data and hence prone to error, or susceptible to adversarial attack. Distinct classification outputs are associated with different RG fixed points and sensitivity to small changes in the input data is due to the presence of relevant ope...

متن کامل

A Framework for Distributed Deep Learning Layer Design in Python

In this paper, a framework for testing Deep Neural Network (DNN) design in Python is presented. First, big data, machine learning (ML), and Artificial Neural Networks (ANNs) are discussed to familiarize the reader with the importance of such a system. Next, the benefits and detriments of implementing such a system in Python are presented. Lastly, the specifics of the system are explained, and s...

متن کامل

Automated software vulnerability detection with machine learning

Thousands of security vulnerabilities are discovered in production software each year, either reported publicly to the Common Vulnerabilities and Exposures database or discovered internally in proprietary code. Vulnerabilities often manifest themselves in subtle ways that are not obvious to code reviewers or the developers themselves. With the wealth of open source code available for analysis, ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Information & Software Technology

سال: 2022

ISSN: ['0950-5849', '1873-6025']

DOI: https://doi.org/10.1016/j.infsof.2021.106809